AMPLIFICATION ARGUMENTS FOR LARGE SIEVE INEQUALITIES 



E. KOWALSKI 



Abstract. We give a new proof of the arithmetic large sieve inequality based on an am- 
plification argument, and use a similar method to prove a new sieve inequality for classical 
holomorphic cusp forms. A sample application of the latter is also given. 



1. The classical large sieve 

The classical arithmetic large sieve inequality states that, for any real numbers N, Q ^ 1, 
any choice of subsets Q p C Z/pZ for primes p ^ Q, we have 

(1) \{n < iV | n (modp) £ Vt p for p < Q}\ ^ A 



H 



where 



"=En 



\a p \ 



q<Q p\q P 

and A is any constant for which the "harmonic" large sieve inequality holds: for any complex 
numbers (gO, we have 



"n } i 

p) EE'|E-(y) 2 ^E 



a 



2 

n| ; 



b * 

the notation ^ and ^ denoting, respectively, a sum over squarefree integers, and one over 

integers coprime with the (implicit) modulus, which is q here. 

By work of Montgomery- Vaughan and Selberg, it is known that one can take 

A = Q 2 - 1 + iV 

(see, e.g., Th. 7.7]). 

There are a number of derivations of ([1]) from fl2]); for one of the earliest, see [1C)| Ch. 3]. 
The most commonly used is probably the argument of Gallagher involving a "submultiplica- 
tive" property of some arithmetic function (see, e.g., [SJ §2.2] for a very general version). 
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We will show in this note how to prove (pQ) quite straightforwardly from the dual version 
of the harmonic large sieve inequality: A is also any constant for which 



(3) E|E E* /3(g,a)e(-)|^A^ |/%a)| 2 , 

nsgiV q^Q a (mod q) ^ ?<Q a (mod q) 

holds for arbitrary complex numbers (/3(q, a)). This is of some interest because, quite often£| 
the inequality © is proved by duality from ([3]), and because, in recent generalized versions of 
the large sieve (see [8]), it often seems that the analogue of (J3J) is the most natural inequality 
to prove - or least, the most easily accessible. So, in some sense, one could dispense entirely 
with 02]) for many applications! In particular, note that both known proofs of the optimal 
version with A = N — 1 + Q 2 proceed by duality 

Note that some ingredients of many previous proofs occur in this new argument. Also, 
there are other proofs of (pQ) working directly from the inequality ([3]) which can be found 
in the older literature on the large sieve, usually with explicit connections with the Selberg 
sieve (see the references to papers of Huxley, Kobayashi, Matthews and Motohashi in [Til p. 
561]), although none of those that the author has seen seems to give an argument which is 
exactly identical or as well motivated. Also, traces of this argument appear earlier in some 
situations involving modular forms, e.g., in In Section [2J we will use the same method 
to obtain a new type of sieve inequality for modular forms; in that case, it doesn't seem 
possible to adapt easily the classical proofs. 

Indeed, maybe the most interesting aspect of our proof is that it is very easy to motivate. 
It flows very nicely from an attempt to improve the earlier inequality 

(4) \{n <: iV I n(modp) £ % for p <: Q}\ < — , K = V , 

of Renyi, which is most easily proved using (j3J) instead of (JTJ) , as in [U §2.4]. 

We will explain this quite leisurely; one could be much more concise and direct (as in 
Section [2]). 

Let 

S = {n ^ N | n (modp) £ Q p for p ^ Q}, 

be the sifted set; we wish to estimate from above the cardinality of this finite set. From ([3]), 
the idea is to find an "amplifier" of those integers remaining in the sifted set, i.e., an expres- 
sion of the form 



q^iQ a (mod q) 



which is large (in some sense) when n 6 S. Then an estimate for \S\ follows from the usual 
Chebychev-type manoeuvre. 

To construct the amplifier A(n), we look first at a single prime p ^ Q. If n £ S, we 
have n (modp) ^ Q p . If we expand the characteristic function of Q p in terms of additive 



But not always - Gallagher's very short proof, found e.g. in [TTJ Th. 1, p. 549], proceeds directly. 
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characters]! we have then 







W n ) = E «fe«)e(y), a(p,a) = - E la p (x)e^- 

a (modp) ' xGZ/pZ ' 



and the point is that the contribution of the constant function (0-th harmonic) is, indeed, 
relatively "large" , because it is 

V 

and exactly reflects the probability of a random element being in Q p . Thus for n (modp) ^ 
Q p , we have 

(5) / 3 (P' a ) e (y) = c p 

a (modp) 

with 

= |^p[ ^ ^ _ _ a (p a y 
V 

If we only use the contribution of the primes in ([3]), and the amplifier 

A (n) = E E PipMf)' 

pKQ a (modp) 

then by we get 

Ei^wi 2 ^Ei a wi 2 ^ a E E i^ G )i 2 - 

neS n^N psSQa(modp) 

For n & S, the size of the amplifier is 

K-)| 2 = |E E P(p,a)e(™)\ 2 =\Yc P \ 2 = K\ 

by (131), while on the other hand, by applying the Parseval identity in Z/pZ, we get 

E E i/Kp.«)i a = £(^ E iin»i 2 -«M) 2N 

p^Qa(modp) p<Q xgZ/pZ 

= ^c p (l-c p )^K. 

So we obtain 

if 2 |S| < AK, 

i.e., exactly Renyi's inequality @, by this technique. 

To go further, we must exploit all the squarefree integers q ^ Q (and not only the primes) 
to construct the amplifier. This is most easily described using the Chinese Remainder The- 
orem to write 

Z/qZ ~ J] Z/pZ, (Z/qZ) x ~ J] (Z/pZ) >< , 

pk pk 



2 We use this specific basis to use ([3]) , but any orthonormal basis containing the constant function 1 would 
do the job, as in [8]. 



and putting together the amplifiers modulo primes p | q: if n G S then n (modp) ^ Q p for 
all p | g, and hence multiplying out ([5]) over p \ q, we find constants /3(g, a) G C, defined for 
(a, g) = 1 (because (3(p, a) is defined for a coprime with p), such that 

yi /%>°) e (y) = n> 

a(modq) p|<3 

Moreover, because the product decomposition of the Chinese Remainder Theorem is com- 
patible with the Hilbert space structure involved, we have 

e* i/3(^)i 2 = n e* i^fe^i^n^ 1 -^)- 

a(mod<j) P mod g a ( mo dp) p\q 

Arguing as before, we obtain from ([3]) - using all squarefree moduli q ^ Q this time - that 

A 

with 

^E'rN 1 -^ b = YU 

q^Q p\q q<:Q p\q 

This is not quite ([T]), but we have some flexibility to choose another amplifier, namely, 
notice that this expression is not homogeneous if we multiply the coefficients (3(q,a) by 
scalars independent of a, and we can use this to find a better inequality. Precisely, let 



(6) |5|<A- 2 . 



Cp . 



p\q 

where £ p are arbitrary real coefficients. 

Then we have the new amplification property 

a(mod<j) ' p\q 

with altered "cost" given by 

a(modg) p\q 

so that, arguing as before, we get 
with 

^ = y! ruw 1 - ^ Bi = E b n w 

<?s?<2 pI<? P\q 
By homogeneity, the problem is now to minimize a quadratic form (namely A\) under a 
linear constraint given by B\. This is classical, and is done by Cauchy's inequality: writing 



Y[c p , p 9 =jj(i-c p ), ^=n^ 

p|g p\q p\q 



for ease of notation, we have 

c, 



= (E «*) < (E (E |) = 

g^Q q^Q i 

with equality if and only if £ p is proportional to 



t 1 P 

Sp 



X p I I 

in which case 

P 1 1 ip I 

and we get A X = B X = H, hence \S\ < Aif -1 , which is (JTJ). 

Remarks. (1) The last optimization step is reminiscent of the Selberg sieve (see, e.g., [7J p. 
161, 162]). Indeed, it is well known that the Selberg sieve is related to the large sieve, and 
particularly with the dual inequality d2J), as explained in [5j p. 125]. Note however that 
the coefficients we optimize for, being of an "amplificatory" nature, and different from the 
coefficents typically sought for in Selberg's sieve, which are akin to the Mobius function 
and of a "mollificatory" nature. 

(2) The argument does not use any particular feature of the classical sieve, and thus 
extends immediately to provide a proof of the general large sieve inequality of [HI Prop. 
2.3] which is directly based on the dual inequality [HI Lemma 2.8]; readers interested in the 
formalism of [H] are encouraged to check this. 

Example. What are the amplifiers above in some simple situations? In the case - maybe 
the most important - where we try to count primes, we then take fl p = {0} to detect integers 
free of small primes by sieving, and (jSJ) becomes 

i (mod 

if p \ n. Then, for q squarefree, the associated detector is the identity 

^ q V q J g' 

a (mod q) 

if (n, q) — 1, or in other words, it amounts to the well-known formula 



py V p y p 

a (modp) 



E <0=m 



q 

a (mod q) 



for the values of a Ramanujan sum with coprime arguments. Note that in this case, the 
optimization process above replaced c p = ^ with 



p — 1 

which is not a very big change - and indeed, for small sieves, the bound (J6J) is not far from (JT| 
and remains of the right order of magnitude. 
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On the other hand, for an example in a large sieve situation, we can take tt p to be the set 
of squares in Z/pZ. The characteristic function (for odd p) is 

ln p (ar)= Yl r ^' a ) e (y) 

a (modp) 

with coefficients given - essentially - by Gauss sums 

£' if))- 

x (modp) 

Then c p tends to 1/2 as p — > +00, while £ p c p tends to 1. This difference leads to a 
discrepancy in the order of magnitude of the final estimate: using standard results on bounds 
for sums of multiplicative functions, (j6]) and taking Q = \/N, we get 

\S\ < v/iVOogiV) 1 / 4 , 

instead of \S\ <C \/N that follows from (JTJ. 



2. Sieving for modular forms 

To illustrate the possible usefulness of the proof given in the first section, we use the same 
technique to prove a new type of large sieve inequality for classical (holomorphic) modular 
forms. The originality consists in using known inequalities for Fourier coefficients (due to 
Deshouillers-Iwaniec) as a tool to obtain a sieve where the cusp forms are the objects of 
interest, i.e., to bound from above the number of cusp forms of a certain type satisfying 
certain local conditions. 

Let k ^ 2 be a fixed even integer. For any integer q ^ 1, let Sk(q)* be the finite set of 
primitive holomorphic modular forms of level q and weight k, with trivial nebentypus (more 
general settings can be studied, but we restrict to this one for simplicity). We denote by 

f(z) = ^y fe - 1)/2 A/(n)e(nz) 

the Fourier expansion of a form / e Sk{q)* at the cusp at infinity. 
We consider on this finite set the "measure" fj, — p, q defined by 



(te)*-i</,/)' 

where (•, •) is the Petersson inner product. This is the familiar "harmonic weight", and we 
denote 

(7) E q (a)= J2 ^ is true) = £ M? ({/}) 

V(f) is true 

the corresponding averaging operator and "probability", for an arbitrary property V{f) 
referring to the modular forms / G Sk{q)* ■ (Note that it is only asymptotically that this is 
a probability measure, as q — > +00). 
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Imitating the notation in [8j Ch. 1] , we now denote by 



Pp 



S k (q)* R 
/ ^ A/(p), 



the p-th Fourier coefficient maps, which we see as giving "global-to-local" data, similar to 
reduction maps modulo primes for integers. If d ^ 1 is a squarefree integer coprime with q, 
we denote 

/ ^ (Pp(f)) P \d = (A/(p)) P |d, 



which we emphasize is a tup/e of Fourier coefficients, that should not be mistaken with the 
single number A/(tf). 

The basic relation with sieve is the following idea: provided Q is small enough, the 
{Pp{f))p^Q become equidistributed as q — > +00 for the product Sato-Tate measure 



where 



P^Q 



1 / t 2 

PST = -l[_ 2 ,2](*)y 1 - J dt, 

and this is similar to the equidistribution of arithmetic sequences like the integers or the 
primes modulo squarefree d, and the independence due to the Chinese Remainder Theorem. 

The quantitative meaning of this principle is easy to describe if Q is bounded (indepen- 
dently of q), but requires some care when it grows with q. For our purpose, we express it as 
given by uniform bounds for Weyl-type sums associated with a suitable orthonormal basis 
of L 2 (vq). The latter is easy to construct. Indeed, recall first the standard fact that the 
Chebychev polynomials X m , m ^ 0, defined by 

(8) X m (2c 0s9) = 3in ( (m+ fl 1 W ee|0 ,4 

sin v 

form an orthonormal basis of L 2 (fisr)- Then standard arguments show that for Q ^ 2 and 
Uq the measure above on [—2,2]"'^, the functions 

A d (x) = l[X mp (x p ), for all x = (x p ) p G [-2,2]^, 
p 

defined for any Q-friable integeiH d ^ 1, factored as 

P^Q 

form an orthonormal basis of L 2 {vq). (In particular we have Ai = I, the constant function 
1.) 



3 I.e., integer only divisible by primes Q. 



We have also the following fact which gives the link between this orthonormal basis and 
our local data (p p ) p ' for any integer m ^ 1 coprime with q and divisible only by primes 
p ^ Q, and any / G Sk(q)*, we have 

(9) A m (p d (f)) = A/(m), where d = JJp. 

This is simply a reformulation of the Hecke multiplicativity relations between Fourier 
coefficients of primitive forms. 

Remark. Our situation is similar to that of classical sieve problems, where (in the framework 
of [5]) we have a set X (with a finite measure /i) and surjective maps X Yg with finite 
target sets Yg, each equipped with a probability density z^, so that the equidistribution can 
be measured by the size of the remainders re(y) defined by 

and the independence by using finite sets m = {£ 1: . . . , £ k }, and 

y m = Y[ Y ^ P m = II Pt : X ~* Y ™ u m{yi, ■ ■ ■ , Vk) = v tl {yi) -"Vt k (yk), 
and looking at 

Kpmiv)) = v{X)v m {y) + r m (y). 

Here the compact set [—2, 2] requires the use of infinitely many functions to describe an 
orthonormal basis. Another (less striking) difference is that our local information lies in the 
same set [—2, 2] for all primes, whereas classical sieves typically involve reduction modulo 
primes, which lie in different sets. 

We now state the analogue, in this language, of the dual large sieve inequality 

Proposition 1. With notation as above, for all Q ^ 1, all integers N ^ 1, all complex 
numbers a(m) defined for m in the set ty q (N,Q) of Q -friable integers ^ iV coprime with q, 
we have 

(10) E q (\ J2 «(m)A m (p d (/))| 2 ) «(l + iVg- 1 )^|a(m)| 2 , 

m&S! q (N,Q) m 

where the implied constant depends only on k and d on the left-hand side is the radical Y\. p \ m P- 

Proof of Proposition Ql This is in fact simply a consequence of one of the well-known large 
sieve inequalities for Fourier coefficients of cusp forms (as developped by Iwaniec and by 
Deshouillers-Iwaniec, see j3]). The point is that because of (J2J), the left-hand side of flTUJ) 
can be rewritten 

S= Yl Pi(if})\ Yl Oi(m)X f (m) 



V ; /es fc (g)* me*,(JV,Q) 
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We can now enlarge this by positivity; remarking that 

| / e S k (q)*} 



can be seen as a subset of an orthonormal basis of the space Sk(q) of cusp forms of weight 
k and level q, and selecting any such basis Bk, q D Sk(q)*, we have therefore 

where we put a(m) = if m ^ ^(-W, Q), and where the A^(m) are the Fourier coefficients, 
so that 

(p(z) = y^m (fc ~ 1)/2 A y (m)e(n2:), 

(as earlier for Hecke forms). Now by the large sieve inequality in Theorem 7.26], taking 
into account the slightly different normalization^ we have 



(11) ^T^T E | E «(m)A,(m)| 2 «(l + iVg- 1 )^| a (m)| 2 

with an absolute implied constant, and this leads to f lTUj) . □ 

Remark 2. In terms of equidistribution (which are hidden in this proof), the basic statement 
for an individual prime p is that 

lim E q (X m (p p (f))) = 0, 

for all m ^ 1. Such results are quite well-known and follow in this case from the Petersson 
formula. There is an implicit version already present in Bruggeman's work (see [U §4] , where 
it is shown that, on average, "most" Maass forms with Laplace eigenvalue ^ T, satisfy the 
Ramanuj an- Petersson conjecture), and the first explicit result goes back to Sarnak [13j, 
still in the case of Maass formsJl Serre [13] and Conrey, Duke and Farmer [2] gave similar 
statements for holomorphic forms, and Royer [12] described quantitative versions in that 
case. 

We can now derive the analogues of the arithmetic inequality (TjQ) and of Renyi's inequal- 
ity (j3J). The basic "sieve" questions we look at is to bound from above the cardinality (or 
rather, /Zg-measure) of sets of the type 

S = {feS k (qy I \ f (p) = Pp (f)^n p ioT P ^Q, p\q}, 

for Q p C [—2,2]. Because the expansion of the characteristic function of Q p in terms of 
Chebychev polynomials involves infinitely many terms, we restrict to a simple type of con- 
dition sets Q p of the following type: 

(12) n p = {xe [-2, 2] | Y p (x) ^ PtO - 5 P }, 
where 

Y p = p Pi o + p Pt iXi + • • • + l3 PyS X s 



4 The case k = 2 requires adding a factor log N. 

5 This is the only result we know that discusses the issue of independence of the coefficients at various 
primes. 
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is a real- valued polynomial and 5 P > (the degree s is assumed to be the same for all p). 
Note that (5 P fi is the /i5T-average of Y p , so our sets S are those where the Fourier coefficients 
for p ^ Q are "away" from the putative average value according to the Sato- Tate measure. 
Denote also by 

-2 , r 2 . 2 



°l = ft,* = / Y P d ^ST ~ U Y p dflST J ■ 

l<i<s J 2 J 2 



the variance of Y p . 

Then the analogue of fll]) is 

(is) E q ((j2(Y P (*f(p))-p P> o)) 2 ) «(i + gv 1 )E< 

where the implied constant depends only on k, and that of ([I]) is 

(14) P q (Y p (\ f (p)) < /3 P)0 - 5 P for all p ^ q) < (1 + N'q- 1 )^ 1 
where <5 P > 0, A/" ^ 1 is arbitrary and 

m&^ q (N,Q) p\m P 

the implied constant depending again only on k. 

To prove ( 1131) . we apply (TTOT) with N = Q s and a(m) = unless m = with 1 ^ j ^ s 
and p ^ Q, p \ q, in which case 

a(p>) = (3 pJ . 
By definition of Y p (x) and of A^, we get 

J2 a(m)A m ( Pd (f)) = E ( Y p(Pp(f)) ~ W> 

me* g (JV,Q) psSQ 

showing that (fTgj) is indeed a special case of ffTUl) . 

To prove ( fl4|) . we use the "amplification" method of the previous section. The basic 
observation is that if, for some prime p, we have 

(15) Y p (\ f (p))^p 0!P -6 p , 
then it follows that 

(-MX l (\ f (p))^5>0. 
Now let £ p , for p ^ Q, be arbitrary auxiliary positive real numbers, and let 

for <i | P(Q), the product of all primes p ^ Q. If (|T5l) holds for all p ^ Q coprime with g, 
then we find by multiplying out that, for any integer m 6 ^f g (N, Q), i.e., such that 

(16) d^N, d\P(Q), (d,q) = l, d = Pl ---p k , (say), 
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and for such we have 

&E ■ ■ E (-^a** • • • Pp>ju*hfrf(tf)) ■ -XiuiWt)) > ^ {d \ 

which translates to 

& ^ a(m)A TO ((A/(p))^) > fdJI^' 

mCSd p\d 

where m runs over the set Sd of integers of the type 

m = T\p Vp{n \ with l<v p (n)<s, JJp < N, 

p\d p\m 

so m ^ iV s , and 

ar(m) = JJ (-/3p,„ p ( m ))- 

Thus, summing over d subject to (TT5|) . squaring, then averaging over / and applying (ITUj) . 
we find that the probability 

V = P q (Y p (X f (p)) < - <5 P for all p < q) 

satisfies 

p«(i + ivv 1 )^, 

where 

^=E^Ei a ( Tn )i a =E^n°j fi i=E^n^ 

d rn£Sd d p\d d p\d 

Cauchy's inequality shows that Bf ^ HAi, with equality if 

£ P = -f , for all p^Q, 

and the inequality above, with this choice, leads to 

V < (1 + N'q-^H' 1 , 

as desired. 

Remark. If one tries to adapt, for instance, the standard proof in [8], one encounters problems 
because the latter would (naively at least) involve the problematic expansion of a Dirac 
measure at a fixed x G [—2, 2] in terms of Chebychev polynomials. 

Here is an easy application of f fl4|) . for illustration (stronger results for that particular 
problem follow from the inequality of Lau and Wu [9], as will be explained with other related 
results in a forthcoming joint work): it is well-known that for / G Sk{q)*, the sequence of 
real numbers {\f(p)) p changes sign infinitely often, and there has been some recent interest 
(see, e.g., the paper [6] of Iwaniec, Kohnen and Sengupta) in giving quantitative bounds on 
the first sign change. We try instead to show that this first sign-change is quite small on 
average over / (compare with jl]): fix A > 0, and let 

S q , A = {fE S k (q)* I A/(p) < for all p < (logg) A } 

ll 



(any other combination of signs is permissible). This is a "sifted set", and we claim that 

l^,A|«g 1/2+1/(2A)+£ 

for any e > 0, where the implied constant depends only on k and e. Since Sk{q)* is of size 
about q (for fixed k), this is a non-trivial bound for all A > 1. Moreover, to prove this 
bound, it suffices to show 

P q (S q ,A) « 9 - 1/2+1/(2A)+£ 

since we have the well-known upper bound yU g ({/}) g~ 1_e for any e > (see, e.g., p. 
138]). 

The sets fl p =]0, 2] used in S Qt A are not exactly in the form f[T2"j) . so we use some smoothing: 
we claim there exists a real polynomial Y of degree s = 2 such that 



(17) 



Y(x) < sgn(x), for all sc G [-2,2], and /3 = J Ydfi S T > -1- 



Assuming such a polynomial is given, we observe that 

A/(p) ^ =*► Y(A/(p)) < -1 = A) - 
for some fixed 5 > 0. Therefore, by (JT3J) with iV = g 1 / 5 , we get for all Q that 

P q (X f (p) ^ for p ^ Q) < iJ- 1 

where 

and the implied constant depends only on k. By assumption, we have 7 > 0, and an easy 
lower bound for H follows in the range of interest simply from bounding 7 w ( m ) ^> m~ e and 
using known results on the cardinality of ^ q (y, (logg) A ): we have 

™e* 9 (g 1/s ,(logg) A ) 

for any e > 0, the implied constant depending only on A, the choice of Y and e. This clearly 
gives the result, and it only remains to exhibit the polynomial Y. One can check easily that 

Y(x) = ~X Q (x) + + \x 2 {x) = "I + f + x 

does the job (see its graph); the numerical values of /3 , 6 and 7 are given by 



Remark 3. See the letter of Serre in the Appendix of [15] for previous examples showing how 
to use limited information towards the Sato-Tate conjecture to prove distribution results for 
Hecke eigenvalues (of a fixed modular form). 
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